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Abstract 

Conditional logics play an important role in recent attempts to formulate theories 
of default reasoning. This paper investigates first-order conditional logic. We show 
that, as for first-order probabilistic logic, it is important not to confound statistical 
conditionals over the domain (such as “most birds fly”)) and subjective conditionals 
over possible worlds (such as “I believe that Tweety is unlikely to fly”). We then address 
the issue of ascribing semantics to first-order conditional logic. As in the propositional 
case, there are many possible semantics. To study the problem in a coherent way, we use 
plausibility struetures. These provide us with a general framework in which many of the 
standard approaches can be embedded. We show that while these standard approaches 
are all the same at the propositional level, they are significantly different in the context 
of a first-order language. Furthermore, we show that plausibilities provide the most 
natural extension of conditional logic to the first-order case: We provide a sound and 
complete axiomatization that contains only the KLM properties and standard axioms 
of first-order modal logic. We show that most of the other approaches have additional 
properties, which result in an inappropriate treatment of an infinitary version of the 
lottery paradox. 


*A preliminary version of this paper appears in Proc. National Conference on Artificial Intelligence 
(AAAI ’96), 1996, pp. 1305-1312. Some of this work was done while all three authors were at the IBM 
Almaden Research Center, supported by the Air Force Office of Scientific Research (AFSC) under Contract 
F49620-91-C-0080; some was done while Daphne Roller was at U.C. Berkeley, supported by a University 
of California President’s Postdoctoral Fellowship; and some was done while Nir Friedman was at Stanford 
University. This work was also partially supported by NSF grants IRI-95-03109 and IRI-96-25901. 



1 Introduction 


In recent years, conditional logic has come to play a major role as an underlying foundation 
for default reasoning. Two proposals that have received a lot of attention ||Geffner 1992 ; 
Goldszmidt, Morris, and Pearl 19^^ are based on conditional logic. Unfortunately, while it 


has long been recognized that hrst-order expressive power is necessary for a default reasoning 
system, most of the work on conditional logic has been restricted to the propositional case. 
In this paper, we investigate the syntax and semantics of first-order conditional logic, with 
the ultimate goal of providing a hrst-order default reasoning system. 

Many seemingly different approaches have been proposed for giving semantics to condi¬ 
tional logic, including preferential structures fLewis 1973| ; [Boutilier 1994| ; [Kraus, Lehmann" 


and Magidor 1990|| , e-semantics [[Adams 1975| ; [Pearl 1989[| , possibility theory jpubois and| 


Prade 199 1[|, and n-rankings |[Spohn 1988[; [Goldszmidt and Pearl 1992[|. In preferential 


structures, for example, a model consists of a set of possible worlds, ordered by a preference 
ordering -<. If tc -< w', then the world w is strictly more preferred/more normal than w'. 
The formula Bird—>Fly holds if in the most preferred worlds in which Bird holds. Fly also 
holds. (See Section ^ for more details about this and the other approaches.) 

The extension of these approaches to the hrst-order case seems deceptively easy. After all, 
we can simply have a preference ordering on hrst-order, rather than propositional, worlds. 
However, there is a subtlety here. As in the case of hrst-order probabilistic logic [[Bacchus 
1990; [Halpern 1990[, there are two distinct ways to dehne conditionals in the hrst-order case. 


In the probabilistic case, the hrst corresponds to (objective) statistical statements, such as 
“90% of birds hy”. The second corresponds to subjective degree of belief statements, such as 
“the probability that Tweety (a particular bird) hies is 0.9”. The hrst is captured by putting 
a probability distribution over the domain (so that the probability of the set of hying birds 
is 0.9 that of the set of birds), while the second is captured by putting a probability on the 
set of possible worlds (so that the probability of the set of worlds where Tweety hies is 0.9 
that of the set of worlds where Tweety is a bird). The same phenomenon occurs in the case 
of hrst-order conditional logic. Here, we can have a measure (e.g., a preference order) over 
the domain, or a measure over the set of possible worlds. The hrst would allow us to capture 
qualitative statistical statements such as “most birds hy”, while the second would allow us 
to capture subjective beliefs such as “I believe that the bird Tweety is likely to hy”. It is 
important to have a language that allows us to distinguish between these two very diherent 
statements. Having distinguished between these two types of conditionals, we can ascribe 
semantics to each of them using any one of the standard approaches. 

There have been previous attempts to formalize hrst-order conditional logic; some are 
the natural extension of some propositional formalism [Pelgrande 1987[; Brafman 1997[|, 


while others use alternative approaches [[Lehmann and Magidor 1990[ [jchlechta 1995|[ . (See 
Sections ^ H, and ^ for a more detailed discussion of the alternative approaches.) How do we 


make sense of this plethora of alternatives? Rather than investigating them separately, we use 
a single common framework that generalizes almost all of them. This framework uses a notion 
of uncertainty called a plausibility measure, introduced by Friedman and Halpern |[1995|| . A 


1 













































plausibility measure associates with set of worlds its plausibility, which is just an element in a 
partially ordered space. Probability measures are a subclass of plausibility measures, in which 
the plausibilities he in [0,1], with the standard ordering. Friedman and Halpern ||1998|| show 
that the different standard approaches to conditional logic can all be mapped to plausibility 
measures, if we interpret Bird—>Fly as “the set of worlds where Bird A Fly holds has greater 
plausibility than that of the set of worlds where Bird A ^Fly holds”. 

The existence of a single unifying framework has already proved to be very useful in the 
case of propositional conditional logic. In particular, it allowed Friedman and Halpern ||1998 
to explain the intriguing “coincidence” that all of the different approaches to conditional 


logic result in an identical reasoning system, characterized by the KLM postulates [[Kraus, 


[Lehmann, and Magidor 19^ . In this paper, we show that plausibility spaces can also be used 


to clarify the semantics of hrst-order conditional logic. However, we show that, unlike the 
propositional case, the different approaches lead to different properties in the hrst-order case. 
Of course, these are properties that require quantihers and therefore cannot be expressed 
in a propositional language. We show that, in some sense, plausibilities provide the most 
natural extension of conditional logic to the hrst-order case. We provide sound and complete 
axiomatizations for both the subjective and statistical variants of hrst-order conditional logic 
that contain only the KLM properties and the standard axioms of hrst-order modal logic.Q 
Essentially the same axiomatizations are shown to be sound and complete for the hrst-order 
version of e-semantics, but the other approaches are shown to satisfy additional properties. 

One might think that it is not so bad for a conditional logic to satisfy additional proper¬ 
ties. After all, there are some properties—such as indiherence to irrelevant information—that 
we would like to be able to get. Unfortunately, the additional properties that we get from 
using these approaches are not the ones we want. The properties we get are related to the 
treatment of exceptional individuals. This issue is perhaps best illustrated by the lottery 
paradox [[Kyburg 1961[| .p[ Suppose we believe about a lottery that any particular individual 
typically does not win the lottery. Thus we get 


Vx(f™e—»--i Winner{x)). 

However, we believe that typically someone does win the lottery, that is 

true—Winner {x). 


( 1 ) 


( 2 ) 


Let Lottery be the conjunction of (|l]) and (@). 

^By way of contrast, there is no (recursively enumerable) axiomatization of of either statistical or subjec¬ 
tive first-order probabilistic logic; the validity problem for these logics is highly undecidable (11^ complete) 
[Abadi and Halpern 1994]. 


^We are referring to Kyburg’s original version of the lottery paradox [Kyburg 1961 [, and not to the 
Unitary version discussed by Poole [1991 [. As Poole showed, any logic of defaults that satisfies certain 


minimal properties—properties which are satisfied by all the logics we consider—is bound to suffer from his 
version of the lottery paradox. 


2 















Unfortunately, in many of the standard approaches, such as Delgrande’s 
of hrst-order preferential structures, from (|I]) we can conclude 




version 


true^W Winner {x)). 


(3) 


Intuitively, from (|l|) it follows that in the most preferred worlds, each individual d does not 
win the lottery. Therefore, in the most preferred worlds, no individual wins. This is exactly 
what @ says. Since (|^) says that in the most preferred worlds, some individual wins, it 
follows that there are no most preferred worlds, i.e., we have true — >false. While this may 
be consistent (as it is in Delgrande’s logic), it implies that all defaults hold, which is surely 
not what we want. Of all the approaches, only e-semantics and plausibility structures, both 
of which are fully axiomatized by the hrst-order extension of the KLM axioms, do not suffer 
from this problem. 

It may seem that this problem is perhaps not so serious. After all, how often do we reason 
about lotteries? But, in fact, this problem arises in many situations which are clearly of the 
type with which we would like to deal. Assume, for example, that we express the default 
“birds typically hy” as Delgrande does, using the statement 


W X {Bird {x)^ Fly (x)). 


(4) 


Suppose we also believe that Tweety is a bird that does not hy. There are a number of ways we 
can capture beliefs in conditional logic. The most standard [[Friedman and Halpern 1997|| is to 
identify belief in ip with ip typically being true, that is, with true — >p. Using this approach, 
our knowledge base would contain the statement true^ Bird {Tweety) A ^Fly{Tweety)we 
could similarly conclude true — >false. Again, this is surely not what we want. 

Our framework allows us to deal with these problems. Using plausibilities. Lottery does 
not not imply true — >false, since does not follow from (0)- That is, the lottery paradox 
simply does not exist if we use plausibilities. The hying bird example is somewhat more 
subtle. If we take Tweety to be a nonrigid designator (so that it might denote diherent indi¬ 
viduals in diherent worlds), the two statements are consistent, and the problem disappears. 
If, however, Tweety is a rigid designator, the pair is inconsistent, as we would expect.]^ 

This inconsistency suggests that we might not always want to use (|[) to represent “birds 
typically hy”. After all, the former is a statement about a property believed to hold of 
each individual bird, while the latter is a statement about the class of birds. As argued 
in Pacchus, Grove, Halpern, and Koller 1996[| , defaults often arise from statistical facts about 
the domain. That is, the default “birds typically hy” is often a consequence of the empirical 
observation that “almost all birds hy”. By dehning a logic which allows us to express 
statistical conditional statements, we provide the user an alternative way of representing such 
defaults. We would, of course, like such statements to impact our beliefs about individual 
birds. In [[Bacchus, Grove, Halpern, and Koller 199^ , the same issue was addressed in the 

•^To see this, note that if Tweety is a rigid designator, then Bird{Tweety)—> Fly {Tweety) is a consequence 
of (^). See the discussion in Section^ for more details on this point. 
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probabilistic context, by presenting an approach for going from statistical knowledge bases 
to subjective degrees of belief. We leave the problem of providing a similar mechanism for 
conditional logic to future work. 


The rest of this paper is organized as follows. In Section ^ we review the various ap¬ 
proaches to conditional logic in the propositional case; we also review the dehnition of plau¬ 
sibility measures from [[Friedman and Halpern 1998|| and show how they provide a common 
framework for these different approaches. In the next three sections, we focus on hrst-order 
subjective conditional logic. In Section |^, we describe the syntax for the language and as¬ 
cribe semantics to formulas using plausibility. In Section we provide a sound and complete 
axiomatization for hrst-order subjective conditional assertions. In Section we discuss the 
generalization of the other propositional approaches to the hrst-order subjective case, by 
investigating their behavior with respect to the lottery paradox. We also provide a brief 
comparison to some of the other approaches suggested in the literature. In Sections 1,0, 
and 1^, we go through the same exercise for hrst-order statistical conditional logic, describing 
the syntax and semantics, providing a complete axiomatization, and comparing to other 
approaches. We conclude in Section 0 with discussion and some directions for further work. 


2 Propositional conditional logic 


The syntax of propositional conditional logic is simple. We start with a set <h of propositions 
and close oh under the usual propositional connectives (-,, V, A, and denoting, negation, 
disjunction, conjunction and material implication, respectively) and the conditional connec¬ 
tive —That is, if ip and -0 are formulas in the language, so are 9 ? V 0, 93 A 0, -0, 

and p —>0. 


Many semantics have been proposed in the literature for conditionals. Most of them 
involve structures of the form (W, X, tt), where W is a set of possible worlds, 7r(tc) is a truth 
assignment to primitive propositions, and X is some “measure” on W such as a preference 
ordering [[Lewis 1973] ; [Kraus, Lehmann, and Magidor 1990[] .P[ We now describe some of the 
proposals in the literature, and then show how they can be generalized. Given a structure 
(IT, X, tt), let [</9] C IT be the set of worlds satisfying p. 


A possibility measure [Pubois and Trade 1990[[ Poss is a function Poss : 2^ 1 —[0,1] 
such that Poss(lT) = 1, Poss(0) = 0, and Poss(A) = sup^g^(Poss({tc}). A possibility 
structure is a tuple (IT, Poss, vr), where Poss is a possibility measure on IT. It satishes 
a conditional p — >' 1 !^ if either Poss([</9]) = 0 or Poss([</9 A 0]) > Poss([</9 A “ 10 ]) [Pubois 
and Prade 199l[] . That is, either p is impossible, in which case the conditional holds 
vacuously, or (p A 0 is more possible than p A -,0. 


"‘We could also consider a more general definition, in which one associates a different “measure” with 
each world, as done by Lewis [ 1973 [. It is straightforward to extend our definitions to handle this. Since this 
issue is orthogonal to the main point of the paper, we do not discuss it further here. 
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A K-ranking (or ordinal ranking) on W (as defined by |Goldszniidt and Pearl 1992 


based on ideas that go back to ||Spohn 1988|| ) is a function k : 2^ —> IV*, where 
IV* = IV U {oo}, such that k(W) = 0, fi:(0) = oo, and n,{A) = niin^^,g^ KM))- 
Intuitively, an ordinal ranking assigns a degree of surprise to each subset of worlds 
in W, where 0 means unsurprising and higher numbers denote greater surprise. A 
K-structure is a tuple {W,k,7i), where k is an ordinal ranking on W. It satishes a 
conditional if either ^([ 9 ?]) = cxo or ^([(^9 A '0]) < A “''0]). 

A preference ordering on fh is a partial order -< over W ||Kraus, Lehmann, and Magidoij 
19901; IShoham 1987|] . Intuitively, w -< w' holds if w is preferred to w'. A preferential 


structure is a tuple (hh, tt), where -< is a partial order on W. The intuition ||Shoham| 
198711 is that a preferential structure satishes a conditional (p —>0 if all the most pre¬ 
ferred worlds (i.e., the minimal worlds according to -<) in [<^ 9 ] satisfy 0. However, 
there may be no minimal worlds in [<^ 9 ]. This can happen if [<^ 9 ] contains an inhnite 
descending sequence ... -< w 2 -< Wi. What do we do in these structures? There are 
a number of options: the hrst is to assume that, there are no inhnite descending se¬ 
quences, i.e., that -< is well-founded] this is essentially the assumption made by Kraus, 
Lehmann, and Magidor ||1990|| .p| A yet more general dehnition—one that works even 
if -< is not well-founded—is given in ||Lewis 197^ ; [Boutilier 1994|| . Roughly speaking, 
99 — >0 is true if, from a certain point on, whenever p is true, so is 0. More formally. 


(IT,-<, 7 r) satishes 99 —>0 if, for every world wi G [(^], there is a world W 2 
such that (a) W2 ^ Wi (i.e., either W2 -< Wi or W2 = Wi) (b) W2 G [(^ A 0], 
and (c) for all worlds w^, -< W 2 , we have rcs G 0] (so any world more 

preferred than W 2 that satishes p also satishes 0). 


It is easy to verify that this dehnition is equivalent to the earlier one if -< is well 
founded. 


A parameterized probability distribution (PPD) on hP is a sequence {Pr* : i > 0} 
of probability measures over W. A PPD structure is a tuple (hP, {Pr* : i > 0},7r), 
where {Pr*} is PPD over W. Intuitively, it satishes a conditional p —>0 if the con¬ 
ditional probability 0 given p goes to 1 in the limit. Formally, p — ^0 is satished if 
hmi^ooPD([0llM) = 1 (where Pri([0]]|[(^]) is taken to be 1 if Pr0[99]) = 0). PPD 
structures were introduced in ||Goldszmidt, Morris, and Pearl 1993|] as a reformulation 
of Pearl’s e-semantics llPearl 1989 . 


These variants are quite diherent from each other. As Friedman and Halpern |[I998|| show, 
we can provide a uniform framework for all of them using the notion of plausibility measures. 


A plausibility measure PI on IP is a function that maps subsets of IP to elements in 
some arbitrary partially ordered set. We read P1(A) as “the plausibility of set A”. If 


^Actually, they make a weaker assumption, called smoothness, that for each formula y, there are minimal 
worlds in i.e., that -< is well-founded on the sets of interest. All the results we prove for well-founded 
preferential structures hold for smooth ones as well. 
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P1(>1) < P1(S), then B is at least as plausible as A. Formally, a plausibility space is a tuple 
S = (hF, JF, PI), where hF is a set of worlds, T is an algebra of subsets of W (that is, a set 
of subsets closed under union and complementation), and PI maps the sets in T to some set 
D, partially ordered by a relation < (so that < is reflexive, transitive, and anti-symmetric). 
To simplify notation, we typically omit the algebra T from the description of the plausibility 
space. As usual, we dehne the ordering < by taking di < d 2 if di < d 2 and di ^ d 2 - 
We assume that D is pointed', that is, it contains two special elements T and _L such that 
T< d <T for all d & D] 'we further assume that Pl(fF) = T and P1(0) =_L. Since we want 
a set to be at least as plausible as any of its subsets, we require: 

Al. If A C S, then P1(A) < P1(S). 


Clearly, plausibility spaces generalize probability spaces. Other approaches to dealing 
with uncertainty, such as possibility measures, K-rankings, and belief functions [[Shafer 1976|| , 
are also easily seen to be plausibility measures. 


We can give semantics to conditionals using plausibility in much the same way as it is 
done using possibility. A plausibility structure is a tuple PL = (IF,Pl, 7 r), where PI is a 
plausibility measure on W. We then dehne: 


• PL 1= (p^ip if either Pl([(p]) =_L or Pl([(p A ^p]) > Pl([(p A “’■?/’]). 


Intuitively, ip —holds vacuously if p is impossible; otherwise, it holds if (p A V’ is rnore 
plausible than p A -i^’. It is easy to see that this semantics for conditionals generalizes 
the semantics of conditionals in possibility structures and ^-structures. We are implicitly 
assuming here that [ 93 ] is in JF (i.e., in the domain of PI) for each formula p. 

As shown in [[Friedman and Halpern 1998| , Theorem 4.2], it also generalizes the semantics 
of conditionals in preferential structures and PPD structures. More precisely, a mapping is 
given from preferential structures (resp., PPD structures) to plausibility structures such that 
the semantics of defaults are preserved. For future reference, we sketch these constructions 
here. 


For PPDs, it is quite straightforward. Given a PPD PP = (Pri, Pr 2 ,...) on a space W, 
we can dehne a plausibility measure Plpp such that Plpp(A) < Plpp(i?) ihlimj^oo Pri(i?|AU 
B) = 1. It can then be shown that ((IF, PP, tt), tc) |= p ih ((IF, Plpp, tt), w) |= p for all 
w E W and interpretations tt. 

The mapping of preferential structures into plausibility structures is slightly more com¬ 
plex. Suppose we are given a preferential structure (IF,-<, 7 r). Let Dq be the domain of 
plausibility values consisting of one element d^ for every element w G IF. We use -< to 
determine the order of these elements: dy < dw if w -< v. (Recall that w w' denotes that 
w is preferred to w'.) We then take D to be the smallest set containing Dq closed under 
least upper bounds (so that every set of elements in D has a least upper bound in D). It is 
not hard to show that D is well-dehned (i.e., there is a unique, up to renaming, smallest set) 
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and that taking Pl^(74) to be the least npper bonnd of {d^ w & A} gives ns the following 
property: 

Pl^(kl) < Pl^(-B) if and only if for all ta G A — B, there is a world 

w' & B snch that w' -< w and there is no w" G A — B snch that (5) 

w” -< w'. 


It is then easy to check that {{W, ~<,7i),w) \= ip ii and only if ((IP, Pl_<, vr), w) |= <p, for all 
w G IP and interpretations tt. 


These resnlts show that onr semantics for conditionals in plansibility strnctnres general¬ 
izes the varions approaches examined in the literatnre. Does it captnre onr intnitions abont 
conditionals? In the AI literatnre, there has been discnssion of the right properties of de- 
fanlt statements (which are essentially conditionals). While there has been little consensns 
on what the “right” properties for defanlts shonld be, there has been some consensns on a 
reasonable “core” of inference rnles for defanlt reasoning. This core, is known as the KLM 
properties |[Krans, Lehmann, and Magidor 1990 


We briefly list these properties here: 


LLE. If h (p (p'Q, then from p —infer p' — y'lp 
RW. If h "0 ip', then from p —infer p — 
REF. p—>p 

AND. From p — y'lpi and p — > 1^2 infer p —A 'ip 2 
OR. From pi—ypj and p 2 —infer pi V p 2 —^'ip 
CM. From p—ypJi and p — >'ip 2 infer p A '02—^01 


(Left Logical Eqnivalence) 
(Right Weakening) 
(Reflexivity) 
(And) 
(Or) 

(Cantions Monotonicity) 


LLE states that the syntactic form of the antecedent is irrelevant. Thus, if pi and p 2 
are equivalent, we can deduce P 2 ^'ip from (pi—>0. RW describes a similar property of 
the consequent: If 0 (logically) entails ip', then we can deduce p —^0' from p —^0. This 
allows us to can combine default and logical reasoning. REF states that p is always a 
default conclusion of p. AND states that we can combine two default conclusions: If we can 
conclude by default both 0i and 02 from p, then we can also conclude 0i A 02 from p. OR 
states that we are allowed to reason by cases: If the same default conclusion follows from 
each of two antecedents, then it also follows from their disjunction. CM states that if 0i and 
02 are two default conclusions of p, then discovering that 02 holds when p holds (as would 
be expected, given the default) should not cause us to retract the default conclusion 0i. 

Do conditionals in plausibility structures satisfy the KLM properties? They always satisfy 
REF, LLE, and RW, but they do not in general satisfy AND, OR, and CM. To satisfy 
the KLM properties we must limit our attention to plausibility structures that satisfy the 
following two conditions: 


A2. If A, R, and C are pairwise disjoint sets, P1(A U R) > P1(C), and P1(A U C) > P1(R), 
then P1(A) > P^RUC). 

®where h denotes provability in propositional logic 
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A3. If P1(A) = P1(S) =A, then P1(A U B) =A. 


A plausibility space (IP, PI) is qualitative if it satisfies A2 and A3. A plausibility struc¬ 
ture (IP, PI, tt) is qualitative if (IP, PI) is a qualitative plausibility space. Friedman and 
Halpern ||1998|| show that, in a very general sense, qualitative plausibility structures capture 
default reasoning. More precisely, the KLM properties are sound with respect to a class 
of plausibility structures if and only if the class consists of qualitative plausibility struc¬ 
tures. Furthermore, a very weak condition is necessary and sufficient in order for the KLM 
properties to be a complete axiomatization of conditional logic. As a consequence, once we 
consider a class of structures where the KLM axioms are sound, it is almost inevitable that 
they will also be complete with respect to that class. This explains the somewhat surprising 
fact that KLM properties characterize default entailment not just in preferential structures, 
but also in e-semantics, possibility measures, and ^-rankings. Each one of these approaches 
corresponds, in a precise sense, to a class of qualitative plausibility structures. These results 
show that plausibility structures provide a unifying framework for the characterization of 
default entailment in these different logics. 


3 First-order subjective conditional logic 


We now want to generalize conditional logic to the first-order case. As mentioned above, 
there are two distinct notions of conditionals in first-order logic, one involving statistical 
conditionals and one involving subjective conditionals. For each of these, we use a different 
syntax, analogous to the syntax used in [[Halpern 199U|| for the probabilistic case. In the 
next three sections, we focus on the subjective case; in Sections B0, and p, we consider the 
statistical case. 


The syntax for subjective conditional logic is fairly straightforward. Let <I> be a first-order 
vocabulary, consisting of predicate and function symbols. (As usual, constant symbols are 
viewed as 0-ary function symbols.) Starting with atomic formulas of first-order logic, we form 
more complicated formulas by closing off under truth-functional connectives (i.e.. A, V, 
and =^), first-order quantification, and the modal operator —>■. Thus, a typical formula is 
'ix{P{x)^3y{Q{x,y)^R{y))). Let £'^“^-^($) be the resulting language (the “subj” stands 
for “subjective”, since the conditionals are viewed as expressing subjective degrees of belief). 
We typically omit the <1> if it is clear from context or irrelevant. 

We can ascribe semantics to subjective conditionals using any one of the approaches 
described in the previous section. However, since we can embed all of the approaches within 
the class of plausibility structures, we use these as the basic semantics. As in the propositional 
case, we can then analyze the behavior of the other approaches simply by restricting attention 
to the appropriate subclass of plausibility structures. 

To give semantics to £^“^-^(<F), we use (first-order) subjective plausibility structures over 
<h. These are tuples of the form PL = (T>om, IF, PI, tt), where Dom is a domain, (IF, PI) is 
a plausibility space and 7i{w) is an interpretation assigning to each predicate symbol and 






function symbol in $ a predicate or function of the right arity over Dom. As usual, a 
valuation maps each variable to an element of Dom. We dehne the set of worlds that satisfy 
(p given the valuation v to be = {w : {PL,w,v) |= (p}. (We omit the subscript 

whenever it is clear from context.) For subjective conditionals, we have 

• {PL,w,v)\= if P1 ([v7](pl,^)) =± or Pl([v7 A'0](pp,„)) > Pl([(y9 A ^'0](pp,„)). 

The semantics of atomic formulas and quantihers is the same as in hrst-order logic. As an 
example, for the atomic formula P{x, c), we have 

• {PL, w, v) \= P{x, c) if {v{x), 7i{w){c)) G 7i{w){P). 

Note that n{w){c) is the interpretation of the constant c in the world w. There may be 
a different interpretation of c in each world; that is, we may have 7t{w){c) ^ 7r(tc')(c) if 
w 7 ^ vj'. Thus, c is nonrigid. We return to this issue below. Similarly, 7i{w){P) is the 
interpretation of P in w. 

To give the semantics of quantihcation, it is useful to dehne a family of equivalence 
relations on valuations, where X is a set of variables. We write v n' if v and v' agree 
on the values they give to all variables except possibly those in X. If X is the singleton {x}, 
we write instead of 

• {PL, w, v) 1= \/x(p if (PI, w, v') 1= (p for all valuations v' snch that v' v. 


Becanse terms are not rigid designators, we cannot snbstitnte terms for nniversally qnan- 
tihed variables. (A similar phenomenon holds in other modal logics where terms are not 
rigid IIGarson 1977|| .) For example, let Nip be an abbreviation for -ip — >■ false. Notice that 
{PL,w) \= Np if Pl([-i93]) =T; i.e., Np asserts that the plansibility of -^p is the same as 
that of the empty set, so that p is trne “almost everywhere” .[] Snppose c is a constant that 
does not appear in the formnla p. It is not hard to see that 'ix{-^Np{x)) {-^Np{c)) is 

not valid in onr framework; that is, we cannot snbstitnte constants for nniversally qnan- 
tihed variables. To see this, let p{x) be the formnla P{x), where P is a nnary predicate. 
Consider the plansibility strnctnre PL = ({di, ^ 2 }, 1 ^ 2 }, PI, tt), where tt is snch that c 

is di in world Wi and ^2 in world W 2 , the extension of P in wi is {di} and the extension 
of P in W 2 is {d 2 }, and PI is snch that Pl({wi}) = Pl({tC 2 } t^T. It is easy to see that 
{PL, wi) h ^x{-^NP{x)) A NP{c). 

We could substitute c for x in \/xp{x) if c were rigid. We can get the effect of rigidity by 
assnming that 3x{N{x = c)) holds. Thus, we do not lose expressive power by not assnming 
rigidity. 


As in hrst-order logic, a sentence is a formnla with no free variables. It is easy to check 
that, just as in hrst-order logic, the trnth of a sentence is independent of the valnation. 
Thns, if (p is a sentence, we often write {PL,w) |= p rather than {PL,w,v) \= p. 

stands for “necessary”. 
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4 Axiomatizing first-order subjective conditional logic 


We now want to show that plausibility structures provide an appropriate semantics for a 
hrst-order logic of defaults. As in the propositional case, this is true only if we restrict 
attention to qualitative plausibility structures, i.e., those satisfying conditions A2 and A3 
above. Let be the class of all subjective qualitative plausibility structures. We provide 

a sound and complete axiom system for , and show that it is the natural extension of 
the KLM properties to the hrst-order case. 


The system consists of all generalizations of the following axioms (where is a 

generalization of if (p is of the form \/xi... 'ixn'ip) and rules. In the axioms x and y 
denote variables, while t denotes an arbitrary term. consists of three parts. The 

hrst set of axioms (C0-C5 together with the rules MP, Rl, and R2) is simply the standard 
axiomatization of propositional conditional logic [[Hughes and Cresswell 1968|| ; the second 
set (axioms F1-F5) consists of the standard axioms of hrst-order logic [jEnderton 1972|| . the 


hnal set (F6-F7) contains standard axioms relating the two [[Hughes and Cresswell 1968 


These axioms describe the interaction between N and equality, and hold because we are 
essentially treating variables as rigid designators. 


CO. 

Cl. 

C2. 

C3. 

C4. 

C5. 

FI. 

F2. 

F3. 

F4. 

F5. 

F6. 

F7. 

MP. 

Rl. 

R2. 


All instances of propositional tautologies 




((V^^^i) A (V7^^2)) ^ A -02)) 

a {^2^'tp)) ^ {{^1 V V?2)^^) 

(((y9i^V?2) A ^ (((y9i A V?2)^^) 


Vx(p (p[x/t], where t is substitutable for x in the sense discussed below and p}[x/t] is 
the result of substituting t for all free occurrences of a: in (p (see jEnderton 1972[[ for a 
formal dehnition) 


Va:((p ^ Ip) ^ (yxp> ^ Wx'ip) 


(f ^ Wx(p if X does not occur free in (p 


X = X 

X = y ^ {ip ^ (p'), where (p is a quantiher-free and —>-free formula and (p' is obtained 
from ip by replacing zero or more occurrences of x in <^9 by y 

X = y ^ N{x = y) 
x^y^ N{x 7 ^ y) 

From ip and (p ^ ip infer p) 

From tpi (^2 infer ipi—ypj ^ p> 2 — 

From p^i pj2 infer ip—>pJi ip — >tp2- 
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It remains to explain the notion of “substitutable” in FI. Clearly we cannot substitute a 
term t for x with free variables that might be captured by some quantihers in (p; for example, 
while 'ix3y{x ^ y) is true as long as the domain has at least two elements, if we substitute 
y for x, we get ^y{y 7 ^ y), which is surely false. In the case of hrst-order logic, it suffices 


to dehne “substitutable” so as to make sure this does not happen (see [[Enderton 1972|| for 
details). However, in modal logics such as this one, we have to be a little more careful. As 
we observed in Section ^ we cannot substitute terms for universally quantihed variables in 
a modal context, since terms are not in general rigid. Thus, we require that if <^9 is a formula 
that has occurrences of —then the only terms that are substitutable for x m ip are other 
variables. 

We claim that is the weakest “natural” hrst-order extension of the KLM properties. 
The bulk of the propositional fragment of this axiom system (axioms C1-C4, Rl, and R2) 
corresponds precisely to the KLM properties. For example. Cl is just REF, C2 is AND, 
Rl is LLE, and so on. The remaining axiom (C5) captures the fact that the plausibility 
function PI is independent of the world. We could consider a more general semantics where 
the plausibility measure used depends on the world (see fFriedman and Halpern 1998], Section 


8 ]); in this case, we would drop C5. This property does not appear in ||Kraus, Lehmami 


land Magidor 1990| since they do not allow nesting of conditionals. As discussed above, the 
remaining axioms are standard properties of hrst-order modal logic. 

The system characterizes hrst-order default reasoning in this framework: 

Theorem 4.1:: is a sound and complete axiomatization of with respect to . 

Proof: The proof combines ideas from the standard Henkin-style completeness proof for 
hrst-order logic ||Enderton 1972|| with the proof of completeness for propositional conditional 
logic given in ||Friedman and Halpern 1998|| . The details can be found in the appendix. | 


5 Other approaches to first-order subjective conditional 
logic 

In the previous section we showed that is sound and complete with respect to . 

What happens if we use one of the approaches described in Section to give semantics to 
conditionals? As noted above, we can associate with each of these approach a subset of 
qualitative plausibility structures. Let ^ and be the subsets of 

'^'subf that correspond to well-founded preference orderings, preference orderings, K-rankings, 
possibility measures, and PPDs, respectively. From Theorem [4.1| , we immediately get 

Theorem 5.1:: is sound m '^subj, 'P^bj, P^ubj- 

Is complete with respect to these approaches? Even at the propositional level, it 

is well known that because k rankings and possibility measures induce plausibility measures 
that are total (rather than partial) orders, they satisfy the following additional property: 
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In addition, the plausibility measures induced by k rankings, possibility measures, and e 
semantics are easily seen to have the property that T > _L. This leads to the following 
axiom: 


C7. ^{true^false). 


In the propositional setting, these additional axioms and the basic propositional conditional 
system (i.e., C0-C5, MP, LLE, and RW) lead to sound and complete axiomatization of the 
corresponding (propositional) structures. (See ||Eriedman and Halpern 19^ , Section 8].) 
Does the same phenomenon occur in the hrst-order case? For e-semantics, it does. 

Theorem 5.2:: C^“^-^-l-C7 is a sound and complete axiomatization of with respect to 


v: 


subj ‘ 


Proof: We combine ideas from the proof of Theorem |4-1| with results from [[Friedman and 


Halpern 1998|| showing how a plausibility structure satisfying C7 can be viewed as a PPD 
structure. The details are in the appendix. | 


Although e-semantics has essentially the same expressive power in the hrst-order case as 
plausibility measures, this is not the case for the other approaches that are characterized 
by the KLM properties in the propositional case. These approaches all satisfy properties 
beyond C6, and C7. And these additional properties are ones that we would argue are 

undesirable, since they cause the lottery paradox. Recall that Lottery, the formula that rep¬ 
resents the lottery paradox, is the conjunction of two formulas: (0) \/x{true^^ Winner{x)) 
states that every individual is unlikely to win the lottery, while (^ true — >3xWinner{x) 
states that is is likely that some individual does win the lottery. We start by showing that 
Lottery is consistent in 


Example 5.3:: We define a hrst-order subjective plausibility structure PLiot = {Domiot, Wiot, Phot, T^iot) 
as follows: Domiot is a countable domain consisting of the individuals 1, 2, 3,...; Wiot consists 
of a countable number of worlds Wi,W 2 , w ^,...; Phof gives the empty set plausibility 0, each 
non-empty hnite set plausibility 1/2, and each inhnite set plausibility 1; hnally, the denota¬ 
tion of Winner in world Wi according to niot is the singleton set {di} (that is, in world Wi 
the lottery winner is individual di). It is easy to check that Winner{di)\ = W — {wi}, so 
Winner{di)\) = 1 > 1/2 = Pl([lTmner((ij)]); hence, PLiot satishes (0). On the other 
hand, l3xWinner{x)] = W, so Pliot{l^xWinner{x)]) > P\iot{l~^^x Winner{x)}); hence PLiot 
satishes (0). It is also easy to verify that Phoi is a qualitative measure, i.e., satishes A2 and 
A3. A similar construction allows us to capture a situation where birds typically hy but we 
know that Tweety does not hy. | 


What happens to the lottery paradox in the other approaches? First consider well- 
founded preferential structures, i.e., In these structures, holds if ^|J holds in 







all the preferred worlds that satisfy (p. Thus, © implies that for any domain element d, d 
is not a winner in the most preferred worlds. On the other hand, © implies that in the 
most preferred worlds, some domain element wins. Together both imply that there are no 
preferred worlds. When, in general, does an argument of this type go through? As we now 
show, it is a consequence of the following generalization of A2. 

A2*. If {Ai ; i G /} are pairwise disioint sets, A = Uip/Aj, 0 G /, and for alH G / — |0}, 
P1(A - Ai) > Pl(Ai), then Pl(Ao) > P1(A - Aq). 

Recall that A2 states that if Aq, Ai, and A 2 are disjoint, Pl(Ao U Ai) > P1(A2), and Pl(Ao U 
A 2 ) > Pl(Ai), then Pl(Ao) > Pl(AiUA 2 ). It is easy to check that for any hnite number of sets, 
a similar property follows from Al and A2 by induction. A2* asserts that a condition of this 
type holds even for an inhnite collection of sets. This is not implied by Al and A2. To see this, 
consider the plausibility model PLiot from Example b.3| . Take Aq to be empty and take Aj, 
z > 1, to be the singleton consisting of the world wt. Then 'P\iot{A—Ai) = 1 > 1/2 = Phot(Aj), 
but Phot(Ao) = 0 < 1 = Pl(Uj>oAj). Hence, A2* does not hold for plausibility structures in 
general. It does, however, hold for certain subclasses: 

Proposition 5.4:: A2^ holds in every plausibility structure in and 

Proof: See the appendix. | 

A consequence of A2* is the following axiom, called V3 by Delgrande: 

V3. if X does not occur free in ip. 

This axiom can be viewed as an inhnitary version of axiom C2 (which is essentially KLM’s 
And Rule), for (abusing notation somewhat) in a domain D, V3 essentially says: 

Ad&D{^^ij[x/d]) (cp—^ Aden ^^[x/d]). 


Proposition 5.5:: V3 is valid in all plausibility structures satisfying A^. 

Proof: See the appendix. | 

Since A2* holds in and it follows that V3 does as well. Moreover, it is easy to 

see that the axiom V3 leads to the lottery paradox: From \/x{true — Winner{x)), V3 allows 
us to conclude true — >\/x{-'Winner(x)). 

A2* does not hold in Pf”// and In fact, the inhnite lottery is consistent in these 

classes, although a somewhat unnatural model is required to express it, as the following 
example shows. 
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Example 5.6:: Consider the possibility structure Poss, vr/of), where all the 

components besides Poss are just as in the plausibility structure PLid from Example and 
Poss(r(;j) = i/{i + 1). This means that if i > j, then it is more possible that individual i 
wins than individual j. Moreover, this possibility approaches 1 as i increases. It is not hard 
to show that this possibility structure satishes formulas and (^). 

A preferential structure in the same spirit also captures the lottery paradox. Consider 
the preferential structure {Domiot, Wiot, -<, 'n'lot), where all the components besides -< are just 
as in the plausibility structure PLiot, and we have ... tcs -< ta 2 -< ^2 -< tci. Thus, again we 
have that if z > j, then it is more likely that individual i wins than individual j. (More 
precisely, the world where individual i wins is preferred to that where individual j wins.) It 
is easy to verify that this preferential structure (which is obviously not well-founded) also 
satishes Lottery. | 

Although Lottery is satishable in possibility structures and preferential structures, a slight 
variant of it is not. Consider a crooked lottery, where there is one individual who is more 
likely to win than the rest, but is still unlikely to win. To formalize this in the language, we 
add the following formula that we call Crooked'. 

3y\/x{x ^ y ^ {{Winner{x) V Winner{y)) — >Winner{y))) 

This formula states that there is an individual who is more likely to win than the rest. To see 
this, recall that {(p V 'ip)—yip implies that either Pl([v? V ip}) =T (which cannot happen here 
because of the hrst clause of Crooked) or Pl([(^]) < Pl(['(/>]). We take the crooked lottery to 
be formalized by the formula Lottery A Crooked. 

It is easy to model the crooked lottery using plausibility. Consider the structure = 
{Domiot,Wiot,Pyiot^T^iot)i which is identical to PLiot except for the plausibility measure Pl^j. 
We define Pl(oj(tai) = 3/4; Pl'i^^^Wi) = 1/2 for z > 1; Pl)oj(A) of a hnite set A is 3/4 if 
Wi G A, and 1/2 if zui ^ A; and Phoi(A) = 1 for inhnite A. It is easy to verify that PL[^^ 
satishes Crooked, taking di to be the special individual who is most likely to win (since 
Pl([lTznner((ii)]) = 3/4 > 1/2 = Pl([lTznner((ii)]) for z > 1). It is also easy to verify that 
PlL h Lottery. 

On the other hand, the crooked lottery cannot be captured in and To show 

this, we take a slight detour. 

Consider the following two properties: 

A2t. If {Ai'.ie 1} are pairwise disioint sets, A = UjprAj, 0 G /, and for all z G / — jOj, 
Pl(Ao) > P1(A,), then Pl(Ao) P1(A - Aq). 

A3* If {Aj : z G /} are sets such that Pl(Aj) = T, then Pl(UiAj)T. 

It is easy to see that A2^ is implied by A2*. Suppose that PI satishes A2* and the 
preconditions of A2h By Al we have that Pl(Ao) > Pl(Aj) implies that P1(A —A*) > Pl(Aj). 
Thus by A2* we have that Pl(Ao) > P1(A — Aq), and therefore Pl(Ao) P1(A — Aq). 
Moreover, A2'^ can hold in structures that do not satisfy A2*. 
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Proposition 5.7:: holds in every plausibility structure in and 

Proof: See the appendix. | 

A3* is an infinitary version of A3. It is easy to verify that it holds in all the approaches 
we consider, except plausibility measures and e-semantics. 

Proposition 5.8:: A3* holds in every plausibility structure inV^^^^p V^^j, andVg°^j. 

Proof: The proof is straightforward and left as an exercise to the reader. | 

A3* has elegant axiomatic consequences. 

Proposition 5.9:: The axiom 

\/xNip ^ N(\/xip) 

is sound in structures satisfying A3*. Moreover, the axiom 

Vx((p—((3a;(p) V’)) does not appear free in xf 

is sound in structures satisfying A2* and A^.0 

Finally, we show that when A2l and A3* hold, the crooked lottery is (almost) inconsistent. 


Proposition 5.10:: The formula Lottery A Crooked ^ [true — >false) is valid in structures 
satisfying A2^ and A3*. 

Proof: See the appendix. iNotice that, since A2^ and A3* are valid in Vlfffp it immediately 

follows that Lottery A Crooked is unsatishable in Vlfffp 

To summarize, the discussion in this section shows that, once we move to hrst-order logic, 
kappa-rankings, possibility structures and preferential structures satisfy extra properties over 
and above those characterized by (and C6 and C7). We identihed these properties both 
in terms of the constraints on the plausibility measures allowed by these semantics (e.g., 
conditions A2*, A2f, and A3*), and in terms of corresponding properties in the language 
(e.g., axioms and the variants of the lottery example). Our analysis leaves open the question 
of complete axiomatization of hrst-order conditional logic with respect to these classes of 
structures. 

®The latter axiom can be viewed as an infinitary version of the OR Rule (C3), just as V3 can be viewed 
as an infinitary version of the AND Rule (C2). 
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6 First-order statistical conditional logic 


In the next three section, we analyze the statistical version of first-order conditional logic in 
much the same way we did the subjective version. 

The syntax for statistical conditionals is fairly straightforward. Let $ be a first-order 
vocabulary, consisting of predicate and function symbols. (As usual, constant symbols are 
viewed as 0-ary function symbols.) Starting with atomic formulas of first-order logic, we form 
more complicated formulas by closing off under truth-functional connectives (i.e.. A, V, and 
first-order quantification, and the family of modal operators ip where X is a set 

of distinct variables.^ We denote the resulting language (We typically omit the 

$ if it is clear from context.) The intuitive reading of ip '^x V’ is that almost all of the 
X’s that satisfy ip also satisfy 'ijj. Thus, the modality binds the variables X m. ip and 
'ip, just as Vx binds the occurrences of x in \/xip. A typical formula in this language is 
3y{P{x,y) Q{.x,y)), which can be read “there is some y such that most x’s satisfying 

P{x,y) also satisfy Q{x,y)” . Note that we allow arbitrary nesting of first-order and modal 
operators. For simplicity, we assume that all variables used in formulas come from the set 

{xi,X2,X3, . . .}. 

To give semantics to £'^*“*(<1)), we use (first-order) statistical plausibility structures (over 
<h), which generalize the semantics of statistical probabilistic structures [Pacchus 1990|; 
[Halpern 199(]|| and statistical preferential structures ||Bratman 1997|| . Statistical plausibil¬ 
ity structures over $ are tuples of the form PL = {Dom, vr, PI), where Dom is a domain, tt is 
an interpretation assigning each predicate symbol and function symbol in d) a predicate or 
function of the right arity over Dom, and PI is a plausibility measure on Dom°° (a countable 
product of copies of Dom) that satisfies one restriction, described below. Note that we can 
identify Dom^ with the set of all valuations by associating a valuation v with an inhnite 
sequence (di, ^ 2 ,...) of elements in Dom, where n(xj) = di. Thus, we can view PI as defining 
a plausibility measure on the space of valuations. 

We require that PI treats all variables uniformly, in the following sense: 


• REN. If h is a finite permutation of the natural numbers (formally, h : IN ^ IN is a 
bijection such that h{n) = n for all but hnitely many elements n G IV), then Pl(A^) = 
P1(A) for all A C Dom°°, where A’^ = {{dh(i), dh{2), dh( 3 ), ■ • •) ^ <^ 2 , 4,...) G A}. 


REN assures us, for example, that if A C Dom, then P1(A x Dom°°), the plausibility of 
a valuation giving xi a value in A, is the same as the plausibility of a valuation giving X 2 
a value in A (i.e., Pl{Dom x A x Dom°°)) and, in fact, the same as the plausibility of a 
valuation given Xk a value in A, for all k. As the name suggests, REN guarantees that we 
can rename variables, so that ip ^x t/’ will be equivalent to ip[x/y] ^x[x/y\ fi^[x/y] if y does 


This syntax is borrowed from Brafman |1997 ], which in turn is based on that of [ Bacchus 199C| ; Halpern 
199C|, except that in the earlier papers, the subscript X was taken to be a sequence of variables, rather than 


a set. Since the order of the variables is irrelevant, taking it to be a set seems more natural. 
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not occur 99 or 'll), where X[x/y] is the result of replacing a; in X by ?/ (if x G X; otherwise 
X[x/y]=X). 


We may want to put a number of other restrictions on PI, to make it act like a product 
measure, as Brafman ||1997|| does. While we believe such requirements may be quite reason¬ 


able, we do not make them here, to simplify the presentation. We discuss this issue further 
in Section |. 


Given a statistical structure PL and a valuation v, we can associate with every formula 
(p a truth value in a straightforward way. For an atomic formula such as P{x, c), we have 


• {PL, v) \= P{x, c) if {v{x), vr(c)) G vr(P). 


Note that now we write 7 r(c) rather than 7r{w){c). We no longer have different worlds as we 
did in the subjective case. Thus, the issue of rigid vs. nonrigid designators does not arise in 
the statistical case. 

We again treat quantification just as we do in first-order logic, so 


• {PL, v) 1= Wxip iff {PL, v') \= If for all v' v. 


The interesting case, of course, comes in giving semantics to formulas of the form ip '^x 4’- 
In this case we have 


• {PL,v) 1= p "0 if either Pl(n' : {PL,v') \= p, v' 'y}) =T or Pl({n' : {PL,v') |= 
p Lij), v' "y}) > PKiv' : {PL, v') \= p /\ -''ij), v' n}). 

Again, we implicitly assume here that for each valuation v, vector X of variables, and formula 
p, the set of valuations {v' : {PL, v') \= p, v' ~x t’} is in JF, the domain of PI. 

As before, if (p is a sentence, then the truth of p is independent of n; thus, we write 
PL 1= p rather than PL, v \= p. 


7 Axiomatizing first-order statistical conditional logic 

We can axiomatize hrst-order statistical conditional logic in much the same way as we did 
hrst-order subjective conditional logic. Again, we restrict attention to structures where the 
plausibility measures on DomP’ are qualitative; let be the class of all such structures. 

Let consists of all generalizations of the following axioms, together with the inference 
rule MP. In the axioms, we write VXcr, where X = {xi,...,Xm}, as an abbreviation for 
Vxi... 'ixm.o'- 

CO'. All instances of valid formulas of hrst-order logic with equality 
CP. p ^x P 

C2'. {{p '^x Pi) A {p '^x P2)) ^ {p '^x Pi A P 2 ) 
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C3'. {{ipi V’) A ((^2 V’)) ((‘^i V (^ 2 ) ^x 'ip) 

C4'. {(fi ^2 A </9i ^x ^ A 932 '^x 
Rl'. WX{lpi ^ (P2) ^ {{ipi '^x ^ ^ (9^2 '^x - 0 ) 

R2'. VX(01 02) ^ ((^^ '^X 02 ^ (95 '^X 02) 

U. VX0 ^ ((^ -^x 0) 

Ren. ip '^x 0 ‘^[x/y] '^xix/y] 0[a^/2/], if y does not occur in p or 0. 


As the notation suggests, Cl'-C4', Rl', and R2' are the obvious analogues C1-C4, Rl, 
and R2, except that Rl' and R2' are now axioms rather than inference rules. CO' subsumes 
CO and F1-F5 in Note that we no longer need a special notion of substitutivity; there 

is only one world, and there are no concerns regarding the substitution of nonrigid terms 
into a modal context. For similar reasons, there is no analogue of F6 and F7 here. Ren 
and U are analogues of similar axioms for statistical probabilistic structures [[Bacchus 199U 
Balpern IQQOj]; here we need to require REN to ensure that Ren holds. 


Theorem 7.1:: is a sound and complete axiomatization of 


with respect to Vftat^. 


Proof: The basic idea similar to that of the proof of Theorem |4.1| ; indeed, the proof is even 
simpler. See the appendix for details. I 


8 Other approaches to first-order statistical conditional 
logic 


We have already remarked that we can construct “statistical” hrst-order analogues of all the 
approaches considered in the propositional case. We omit the formal dehnitions here. Let 
'Psiat^'Pstat^'Pstat^'Pstat, and Vltat the subsets of vftat^ that correspond to well-founded 
preference orderings, preference orderings, K-rankings, possibility measures, and PPDs, re¬ 
spectively. The results are similar to those in Section |^, so we just sketch them here. 

With K-rankings and possibility measures, we need to require the obvious analogues of 
C6 and C7, namely 


C6'. {p '^x 0) A “i((p '^x “'0 ^ 7^ A ^ '^x 0 
C7' -^{truc'^x false) 


As we would expect, e-semantics satishes C7' but not necessarily C6'.[^ 


We have the following analogue of Theorem 


^^Brafman 1 1997 | discusses pointed PPDs, in which all the relevant limits are guaranteed to exist; for 
pointed PPDs, C6' holds as well. 
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Theorem 8.1:: C^*“*-/-C7' is a sound and complete axiomatization of with respect to 
-pc 

' stat ’ 


Proof: Follows from the proof of Theorem 
prove Theorem 


M using the same techniques as those used to 


from Theorem 4.1. We omit further details here. I 


Statistical plausibility structures based on well-founded preferential structures and k- 
rankings also satisfy the following analogue of V3: 


V3'. \/y{(p V') if y does not occur free in ip or in X. 


Interestingly, Brafman ||1997|| shows that together with C6', C7', and V3' is complete 
with respect to totally-ordered well-founded preferential structures. Q These are essentially 
identical to the structures generated by K-rankings. Thus, we have the following result. 


Brafman 199711 C 


Theorem 8.2 

zation of with respect to 


'^stat 


stat ‘ 


+ {C6', C7', V3'} is a sound and complete axiomati- 


In light of Brafman’s result, it seems likely that C®*“*-|-{C6', V3'} is a sound and complete 
axiomatization of with respect to although we have not checked details. 

Just as in the subjective case, V3' is not valid in statistical possibility structures or (non¬ 
well-founded) preferential structures, but a variant of the crooked lottery example does give 
us a valid formula for these structures too that does not follow from C*“-l- {C6', C7'}. 

Up to now, we have put minimal structure on the plausibility measure on DomX. In 
the case of statistical probability structures, the probability measure was assumed to be the 
product measure induced by a probability measure on Dom. We can make an analogous 
assumption in the case of e-semantics, possibility measures, and K-rankings. For example, if 
we start with a possibility measure Foss on Dom, we can dehne Poss°° on Dom°° by taking 
Poss°°((ii, ^ 2 , • • •) = infj Poss(di), and taking Poss°°(74) = supPoss°°(cT) for A C Dom°°. 
A similar construction works for K-rankings, except inf is replaced by -|- and sup is replaced 
by min. We get extra properties if we assume such a product measure construction, although 
the exact properties depend on the underlying notion of likelihood that we start with. For 
example, one property we get in all cases is the following: 


• If A, A' C Dom^ and B C Dom"^, y = (|/i,..., ?/„), z = (zi,..., Zm), y and z are dis¬ 
joint, Pl({u : (u(|/i),..., v{yn)) e A}) < PI({u : ( 0 ( 2 / 1 ),..., v{yn)) G A'}) then Pl({o : 
{v{yi),... ,v{yn),v{zi),... ,v{zm)) e AxD}) < Pl({o : ( 0 ( 2 / 1 ),...,o( 2 /„),o(zi),..., 0 ( 2 ;^)) G 
A' X B}). 

Actually, there are a number of minor differences between the framework we have presented and that 
of Brafman. For example, Brafman assumes that there is a separate order defined on Domh, for each finite 
n, rather than one order defined on Dom°°. The two approaches are essentially equivalent—we could have 
used either one here. The connection to valuations is perhaps clearer when we consider Dom°°. He also has 
the axiom {ip '^x tp) {3x(p ^ 3x{(p A ip)) instead of C7'. It is not hard to show that these axioms are 
equivalent in the presence of all the other axioms. 
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This property is captured by the axiom 

(p 'ip ^ (f A (f' 

where the set of variables free in ip' is disjoint from the set of variables free in p A'lp. 

Whether or not we assume that PI is generated as a product measure somehow, once we 
have V3' as an axiom (or the closely related variant as in the crooked lottery example), we 
get the problems in the statistical case similar to those we saw in the subjective case. For 
example, suppose V3' is valid. Consider the statement 

'iy{true^x -^Married{x,y)). 


This states that for any individual y, most individuals are not married to y. This seems 
reasonable since each y is married to at most one individual, which clearly constitutes a 
small fraction of the population. V3' then gives us 


true yy^Married{x, y). 


That is, most people are not married! This certainly does not seem to be a reasonable 
conclusion. 

It is straightforward to construct similar examples for the statistical variants of the other 
approaches, again, with the exception of plausibility structures and e-semantics. We note 
that these problems occur for precisely the same reasons they occur in the subjective case. 
In particular, V3' holds whenever the plausibility measure on Dorrf^ satishes A2*. 

This shows that, just as for the subjective case, we need the greater generality of plau¬ 
sibility measures and e-semantics to correctly model hrst-order statistical reasoning about 
conditionals. 


We observe that problems similar to the lottery paradox occur in the approach of Lehmann 
and Magidor ||199CI|| , which can be viewed as a hybrid of subjective and statistical condition¬ 
als based on on preferential structures. More precisely, rather than putting a preferential 
ordering on worlds or on valuations, they put an ordering on world-valuation pairs. While 
this greater flexibility allows them to avoid some problems associated with putting an order 
solely on worlds or on valuations, the fundamental difficulty still remains. 

Finally, we observe that the approach of l^chlechta 1995|| , which is based on a novel 
representation of “large” subsets, is in the spirit of our notion of statistical defaults (although 
his language is somewhat less expressive than ours). 


9 Discussion 

We have considered a number of different approaches to ascribing semantics to both a subjec¬ 
tive and statistical first-order logic of conditionals in a number of ways. Our analysis shows 
that, once we move to the first-order case, significant differences arise between approaches 
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that were shown to be equivalent in the propositional case. This vindicates the intuition 
that there are significant differences between these approaches, which the propositional lan¬ 
guage is simply too weak to capture. The analysis also supports our choice of plausibility 
structures as the semantics for first-order conditional logic; it shows that, with the exception 
of e-semantics, all the previous approaches have significant shortcomings, which manifest 
themselves in lottery-paradox type situations. Plausibility also lets us home in on what 
properties of an approach give us lead to an infinitary AND rule like V3. 

What does all this say about default reasoning? As we have argued, statements like “birds 
typically fly” should perhaps be thought of as statistical statements, and should thus be 
represented as Bird{x) Fly[x). Such a representation gives us a logic of defaults, in which 
statements such as “birds typically fly” and “birds typically do not fly” are inconsistent, as 
we would expect. 


Of course, what we really want to do with such typicality statements is to draw de¬ 
fault conclusions about individuals. Suppose we believe such a typicality statement. What 
other beliefs should follow? In general, \/x{Bird{x) — >Fly{x)) does not follow; we should 
not necessarily believe that all birds are likely to fly. We may well know that Tacky the 
penguin fLester 1988|| does not fly . As long as Tacky is a rigid designator, this is simply 
inconsistent with believing that all birds are likely to fly. In the absence of information 
about any particular bird, \/x{Bird(x)^Fly(x)) may well be a reasonable belief to hold. 
Moreover, no matter what we know about exceptional birds, it seems reasonable to believe 
truc'^x {Bird{x) — >Fly{x))\ almost all birds are likely to fly (assuming we have a logic that 
allows the obvious combination of statistical and subjective plausibility). 

Unfortunately, we do not have a general approach that will let us go from believing that 
birds typically fly to believing that almost all birds are likely to fly. Nor do we have an 
approach that allows us to conclude that Tweety is likely to fly given that birds typically 
fly and Tweety is a bird (and that we know nothing else about Tweety). These issues were 
addressed in the first-order setting by both Lehmann and Magidor ||1990 | and Delgrande 
198811 . The key feature of their approaches, as well as other propositional approaches rests 
upon getting a suitable notion of irrelevance. While we also do not have a general solution 
to the problem of irrelevance, we believe that plausibility structures give us the tools to 
study it in an abstract setting. We suspect that many of the intuitions behind probabilistic 
approaches that allow us to cope with irrelevance [[Bacchus, Grove, Halpern, and Koller 1996| ; 
[Koller and Halpern 1996(| can also be brought to bear here. We hope to return to this issue 
in future work. 


A Proofs 


Theorem |4.1J: is a sound and complete axiomatization of respect to 


subj 

Proof: A formula (p is said to be consistent with if 1/ -xp. A finite set of formulas 
{(Ti,..., (Tfc} is consistent with if their conjunction cti A... Act?; is consistent with 
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An infinite set S of formulas is consistent with if every finite subset of S is consistent 
with Finally, a set S is said to be a maximal consistent set of sentences if (1) it 

consists only of sentences (recall that a sentence is a formula with no free variables), ( 2 ) it 
is consistent and (3) no strict superset of S consisting only of sentences is consistent. In 
the discussion below, all maximal consistent sets are maximal consistent sets of sentences; 
however, the other consistent sets we construct may include formulas that are not sentences. 


Our goal is to show that a formula ip is consistent with iff it is satisfiable in a 

hrst-order plausibility structure. As usual, this clearly suffices to prove completeness. We 
can also assume without loss of generality that is a sentence, for using standard arguments 
of hrst order logic (see jEnderton 1972| , p. 109]) we can show that if yi,... ,ym are the free 
variables in 93 , then ip is provable iff its universal closure Vyi... 'iym.P is provable, and hence 
p is consistent iff 3yi... 3ymP is consistent. 

Let C be a countable set of constant symbols not in <F, let <F' consist of the symbols in 
that actually appear in p, and let U C.^ As usual in Henkin-style completeness 

proofs, we construct a structure satisfying p using maximal consistent subsets of (in 

the language ($+)). 

A maximal consistent subset A of is said to be C-good if (1) -Nxip G A implies 
-^'ip[x/c\ G A for some c G C and (2) Vx'0 G A implies '^[x/c] G A for all c G C. Note that 
property ( 2 ) holds automatically for maximal consistent sets in hrst-order logic, but does 
not hold in general in our logic, because of our restriction on FI. Intuitively, A is C-good if 
the constants in C are rigid designators such that every domain element is the interpretation 
of some constant in C. 


The proof now proceeds according to the following steps: 


1. We show that there is a C-good maximal consistent set C* that includes p. This follows 
closely the standard Henkin-style completeness proof for hrst-order logic [Enderton 
197211 . 


2. We construct a structure PL by using the formulas in C*. This step uses techniques 
from ipilnderton 1972|| for dehning the domain, and from [[Friedman and Halpern 1998| 


for dehning the set of possible worlds and the plausibility measure over them 

3. We show that PI |= p. Again, this argument is in the spirit of the standard Henkin-style 
completeness for hrst-order logic. 


For the hrst step, we proceed as follows. Let uo, cri,... be an enumeration of the formulas 
in the language C'^“^-^(<F+). We inductively construct a sequence Ao,Ai,... of hnite sets 
of formulas such that An is consistent. Let Aq = {p}. Let Ak+i consist of Ak together 
with the formula -Nxak+i ^ -icTfc+i[x/c], where c is a constant in C that does not appear 
in any of the formulas in Ak or in ak+i. (This is possible since Ak is hnite.) Intuitively, 


^^This pro of would go through without change if we took <&+ to be $ U C. However, for the proof of 
Theorem 5.2, it is useful to restriction attention to a language that is guaranteed to be countable. 
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-Nxak +1 ^ ^CFk+i[x!c\ says that c provides a witness to the fact that ^au+i does not hold 
for all X, if such a witness is necessary. 

We claim that A* = VJkAk is consistent. This follows from the following somewhat more 
general lemma. 

Lemma A.l:: If Bq is a finite consistent set of formulas, and for k > 0, Bk+i = Bk U 
{-Nxa ^ ct[x/c]} for some formula a and constant c that does not appear in Bk or a, then 
UkBk is consistent. 


Proof: From the dehnition of consistency, it clearly suffices to prove that Bk is consistent 
for all k > 0. We do this by induction on k. By assumption Bq is consistent. Suppose Bk is 
consistent but Bk+i is not. Suppose Bk+i = BkU {-iVxcx -icr[x/c]}. Identifying Bk with 
the conjunction of formulas in Bk, it then follows that 

h ^ (^VxcT A a[x/c]). 


Since c does not appear in any of the formulas in Bk or a, a standard argument from 
hrst-order logic (see |pi)nderton 1972|, p. 116]) can be used to show that 


h Sfc ^ (^Vxa A Vxa), 


contradicting the consistency oi Bk- I 

Let B* consist of all the formulas in A* together with all the formulas of the form 
Vxcr ^ (t[x/c], where a is a formula in and c G C. We claim that B* is consistent. 
For suppose not. Then there exists a hnite set of formulas in A*, say A', and a hnite set of 
formulas B' C and a hnite set of constants C' C C such that 

Qsub] p ^ ((A<^6S'Vxcr) A (V^eB/,ceC'^o-[a:/c])). 


Suppose C = {ci,...,Cfc}. Let yi,...,yk be fresh variables, that do not appear in the 
formulas in A' or B'. Let A" and B" be the result of replacing all occurrences of Ci,..., 
in the formulas in A' and B', respectively, by the variables yi,... ,yk. Again, using standard 
techniques jEnderton 1972|, p. 116], we have 


Qsubj p ^ ((A^g^//Vxcr) A ,y(^{yi,...,yt,}^(y[x/y])). 


It follows from FI that (Aj^gs'/Vxcr) A iyaeB",y&{yT_,...,yt,}^(^[x/y]) is inconsistent. Thus, A" 
must be inconsistent. But it follows from Lemma |A.1| that A" is consistent. This gives us 
the desired contradiction. 


Let consist of all the sentences in B*. Using standard techniques, we can extend 
to a maximal consistent set of sentences: We construct a sequence Co,Ci,... of consistent 
sets of sentences by taking Cq = and Ck+i = Cfc U {cr^} if Ck U {cr^.} is consistent and 
cTfc is a sentence, and Ck+i = Ck otherwise. Let C* = C* is then easily seen to 
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be a maximal consistent set of sentences. Moreover, our construction guarantees that C* is 
C-good and contains ip. This completes Step 1 of the proof. 

We now proceed to the second step of the proof, where we construct a hrst-order sub¬ 

jective plausibility structure based on C*. First, however, we need two more dehnitions in 
order to allow us to characterize the domain and the set of possible worlds in our desired 
plausibility structure. 

• We dehne an equivalence relation ~ on C by dehning c ~ c' if c = c' G C*. Let 

[c] = {c' : c ~ c}. As we shall see, these equivalence classes will be the domain 

elements in our structure. 

• If A is a set of formulas, dehne A/N = {'ijj : N'lp G A}. The worlds in our structure 
will be all C-good maximal consistent sets A of sentences such that A/N = C*/N. 

We want to ensure that global properties, such as equality of domain elements and condi¬ 
tional statements, are true in all worlds in the structure. As we shall see, our construction is 
such that formulas in C*/A^ are true in all these worlds. Thus, we need the following lemma. 

Lemma A.2:: Ifi/ is of the form x/' — >'ip" or c = c', for c, c' G C and A is a C-good maximal 
consistent set, ijj ^ A iff ^jJ ^ A/N. 

Proof: Suppose A is a C-good maximal consistent set of formulas and is of the form 
If G A, then Nfj G A by C5, so E A/N. Conversely, if E A/N, then 
Nip E A. Suppose, by way of contradiction, that ip ^ A. Since A is a maximal consistent 
set, we must have ->'0 G A. By C5, it follows that N^ip E A. That is, both ip — >false and 
-<'ip — >false are in A. By C3, it follows that true — >false E A. By RW, we have that both 
true —and true — ^tp” are in A. By C4, we have that ip = ip' — >ip" E A, contradicting our 
assumption. 

Now suppose Ip is of the form c = c'. By F6, we have that \/x, y{x = y ^ N{x = y)) E A. 
Since A is C-good, it follows that c = c' ^ N{c = c') E A. Similarly, by F7, we have that 
c 7 ^ c' ^ A(c 7 ^ c') G A. We can now show that -0 G A iff G A/N just as we did in the 
case of ip'^ip", replacing the use of C5 by these consequences of F6 and F7. | 

We construct a hrst-order subjective plausibility structure PL = {Dom, W, PI, vr) as fol¬ 
lows: 


• Dom = {[c] : c G C} 

• W = {w ■. w is a. C-good maximal consistent set of sentences and w/N = C*PBox 
(i.e., Nip G w iff Nip E C*) } 

• TT is dehned so that 7r(tc)(c) = [c] for the constants c G C and 7t{w) for the symbols in 

is determined by the atomic sentences in w in the obvious way (see ||Enderton 1972|, 
p. 131]). For example, if P is a binary predicate, then ([c], [c'j) G 7i{w){P) iff P(c,c') 
is one of the formulas in w. 
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PI is defined so that for all formulas and 'ip', we have P1([V’]) < P1(['0^]) iff {pJ V 
'ijj')^7p' G C*, where [a] = [w : a & w}. 


In their completeness proof for propositional conditional logic, Friedman and Halpern ||1998 
dehne a plausibility measure PI in a similar way and show that it is well dehned (i.e., if 
[pj] = [pj'] and [cr] = [ex'], then (-0 V a) — >a G C* iff (0' V a') — >a' G C*) and that it satishes 
Al, A2, and A3, so that PI is a qualitative plausibility measure. (See the proof of Theorem 
8.2 in [[Friedman and Halpern 1998|| .) The same proof applies here without change, so we do 
not repeat it. 

Finally, we move to the third and last step of the proof: showing that PL satisfy ip. To 
do so we prove the standard truth lemma, namely 


(*) {PL, tc) 1= 0 iff 0 G w, for all w G W. 


We prove (*) by a straightforward induction on the depth of nesting of —^ in -0, with a 
subinduction on structure. 


If 0 is an atomic formula of the form c = c', this follows from Lemma |A.2| and the 
dehnition of tt. For other atomic formulas, this is immediate from the dehnition of vr. 


If 0 is a conjunction or a negation, it is immediate from the induction hypothesis. 
If 0 has the form 0'—^0", then we have 


iff 

iff 

iff 

iff 

iff 

iff 

iff 

iff 


{PL,w) h 0'^0" 

P1([0' A 0"]) > P1([0' A ^0"]) or P1([0']) = T 
P1([0' A 0"]) > P1([0' A ^0"]) or P1([0']) = T 
((0' A 0") V (0' A -10"))—>(0' A 0") G C* or false G C* 
G C* or 'ip'^false G C* 

0'—>0" G C* 

Ar(0'^0") G C* 

N {ijj' G w 

0'—>0" G w 


[induction hypothesis] 
[by dehnition of PI] 
[by LLE,C1,C2 
[by RW] 
[by Lemma |A.2|| 
[since ru G IF] 
[by Lemma |A.2|| 


Finally, if 0 has the form Va;0', then we have 



{PL,w 

) = Va;0 


ih 

{PL, w 

) = 0[a;/c] for all c G C 


ih 

0[a;/c] 

G w for all c G C 

[induction hypothesis] 

ih 

Va;0 G 

w 

[since w is C-good] 


This completes the proof of (*). It now follows that {PL,C*) \= ip. We can easily get 
from this a structure over the vocabulary $ that satishes the formula (p: we simply dehne tt 
in an arbitrary way for the symbols in $ — and ignore the interpretation of the symbols 
in C. This completes the proof of the theorem. | 
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Theorem |5.2|: is a sound and complete axiomatization of with respect to 


V 


subj' 


Proof: As in the proof of Theorem |4.1j , it suffices to show that if (p is consistent with 
Qsubj^ then it is satishable in a structure in The hrst steps in the proof mimic 

those of the proof of Theorem E3- We dehne = $' U C as in the proof of Theorem m 
We then construct a plausibility structure PL = {D, W, PI, tt) satisfying ip by considering 
maximal (C^“^-^+C7)-consistent sets of sentences. It is easy to see that the structure PL is 
what is called in [[Friedman and Halpern 1998|| (following [[Lewis 1973|| ) normal, we must 
have Pl(iy) > T (otherwise C7 would not be valid in PL). By Theorem 6.3 in [[Friedman and 
[Halpern 1998[[ , it follows that there is a PPD PP on W that satishes the same defaults. More 
precisely, if PLpp = {D, W, Plpp, tt), where Plpp is the plausibility measure corresponding to 
PP, as described in Section^, then we have {PL,w) \= iff {PLpp,w) \= -0—A 

straightforward induction on the structure of formulas now shows that PL and PLpp agree 
on all sentences in ($■•■). This gives us the desired PPD structure satisfying ip, and 
completes the proof. | 


Proposition 


1 ^: 


holds in every plausibility structure in Vgffj and 


Proof: We start with Suppose PL = {D, W, PU, vr) G where k is the ranking to 

which PIk corresponds. Since lower ranks correspond to greater plausibility, we have k{A) < 
k{B) iff P1 k(A) > P1k(P). Let {Ai : i G /} be a collection of pairwise disjoint sets such that 
k{A — Ai) < ^(Aj) for alH G / — {0}. We claim that (1) k(A) = ^{Aq), (2) k{A) < «:(Aj) for 
i & I — {0}, and (3) k(Ao) < K(Uig 7 _{o}Aj). (2) follows immediately from the assumption 
that k{A — Ai) < K{Ai), since k{A) < k{A — Ai). ( 1 ) follows from ( 2 ) and the observations 
that (a) k{A) = miujg/ ^(Aj) and (b) the range of k is the natural numbers. Finally, (3) 
follows from (1) and (2) and the observation that K(Ujg/_{o}Aj) = minjg/_{o} ^(Aj). From 
(1), ( 2 ), and (3), it is immediate that k(Ao) < k(A — Aq). 

The argument in the case of Vgffj is similar in spirit. Suppose PL = (D, IT, Pl^, tt) G 
where Pl_< is constructed from a partial order -< on IT as described in Section]^. Again, 
let {Ai : i G /} be a collection of pairwise disjoint sets such that P1-<(A — Aj) < P1^(71.) 
for alH G / — {0}. Recall that Pl^(A) < Pl^(i?) if and only if for all w G A — R, there 
is a world w' E B such that w' -< w and there is no w" E A — B such that w" -< w'. 
Thus, to show that Pl^(Ao) > P1-<(A — Aq), we must show that if w G A — Aq, then there 
exists some w' E Aq such that w' -< w and there is no w" E A — Aq such that w” -< w'. 
Suppose not. Then we construct an inhnite decreasing sequence .. .Wk P Wk-i -<■■■-< Wq, 
contradicting the assumption that -< is well founded. We proceed as follows. Let wq = w. 
Suppose inductively we have constructed wq, ... ,Wk. If Wk E Aq, by assumption, there is 
some Wk+i G A — Aq such that Wk+i -< Wk. If Wk E A — Aq, then Wk E Ai for some i 7 ^ 0. 


^^Theorem 6.3 in [Friedman and Halpern 199S[ applies only to countable languages, so it is important that 
we use $+ here, rather than $ U C, which may not be countable. 
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Since Pl^(^ — A) > Pl^(Aj), it follows from the construction of Pl_< that there must be 
some Wk+i & A — Ai such that Wk+i -< Wk- This completes the inductive proof, and gives us 
the desired contradiction. It is easy to see that because Aq and A — Aq are disjoint, the fact 
that Pl^(y4o) > Pl^(^ — ^o) implies that Pl^(y4o) > Pl-<(^ — ^o), as desired. I 


Proposition 


0|: 


V3 is valid in all plausibility structures satisfying A2*. 


Proof: Suppose PL = (D, fP, PI, vr) satishes A2* and (PL, ta, v) \= Va:((p—>-0), where x does 
not appear free in (p. It follows that 

[(p A fjlpij ^ > [(p A ^ for all valuations v' such that v' v- (6) 

Let A = [v7] PI ^ (Note that we have A = ip}pi,^yi for all v' such that v' v, since x 
is not free in (p.) For each d & D, let A^ = {p A where Vd v and Vd{x) = d. 

Let A' = \p A Vx'i/']. Note that A = A' U {Udf^oAd) and that A' is disjoint from Ad, for each 
d E D. The sets Ad are not necessarily disjoint. Thus, let Bd be such that Bd C Ad, the 
sets Bd are pairwise disjoint, and Ud^nBd = Ud^nAd- (We can always hnd such sets Bd- If 
D is countable, say D = {1, 2, 3,...} without loss of generality, then we can take Pi = Ai 
and Pfc+i = Ak+i — (Pi U .. .Pfc). If D is uncountable, we must hrst well-order P; then a 
similar inductive construction works.) Thus, we have A = A' U (U^eD-Bd), and all the sets 
on the right-hand side are pairwise disjoint. 

From (^), it follows that P1(A — Ad) > Pl(Ad), for all d E D, so clearly P1(A — Bd) > 
Pl(Pd), since Bd C Ad. From A2*, it follows that Pl(A') > P1(A — A'). Thus, {PL,w,v) |= 
p —as desired. I 


Proposition 


5.7: 


A2^ holds in every plausibility structure in and 


Proof: We start with Suppose that POSS = (P, W, Poss, vr) is a possibility structure. 

Let {Ai : i E 1} he a. collection of pairwise disjoint sets such that Poss(Ao) > Poss(Aj) for 
all z G / — {0}. This implies that Poss(Ao) > supjgi-_|o} Poss(Aj) = Poss(A — Aq). We 
immediately get that Poss(Ao) -fi Poss(A — Aq). 

The argument in the case of is similar in spirit, although somewhat more involved. 
Suppose PL = (P, W, Pl^, vr) G Pf^bj, where Pl_< is constructed from a partial order -< on 
W as described in Section Again, let {Ai : z G /} be a collection of pairwise disjoint sets 
such that Pl^(Ao) > Pl^(Aj) for all z G / — {0}. 

By way of contradiction, suppose that Pl^(Ao) < P1^(A — Aq). Let w G Aq. Since 
Pl^(Ao) < P1^(A — Ao), there is a world w' E A — Aq such that w' -< w and there is no 
w" E Aq such that w" -< w' . Let Aj be the set that contains w' . (There must be such an index, 
since A — Aq is the union of such sets.) Since Pl^(Aj) < Pl^(Ao), there is a world w” G Aq 
such that w” -< w'. Thus, we get a contradiction. We conclude that Pl^(Ao) -fi P1-<(A — Aq) 
I 
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Proposition 


5.9: 


The axiom 


yxNif N{'ix(p) 

is sound in structures satisfying A3*. Moreover, the axiom 

Wx^ip^fj) ^ {(f3iXip) ' 0)5 'i'f ^ does not appear free in fj 
is sound in structures satisfying A2* and A3*. 

Proof: For the first part of the proposition, suppose that PL = {D,W, PI, vr) be a plausibility 
structure satisfying A3*. Assume that there is a world w G W and a valuation v such that 


{PL, w, v) 1 = 'ixNtp. 

This means that if V, then 

{PL,w,v')^Nif (7) 

For each d E D, let Vd be the valuation such that Vd v and Vd{x) = d. Let Ad = {w' : 
{PL,w',Vd) \= 9?}. From (^, we have that Pl(hF — Ad) = T for all d E D. Using A3*, we 
get that PL{W — (flrfA^i)) = T. Thus, 

{PL,w,v) \= NWxip 


as desired. 

For the second part of the proposition, suppose that PL = {D,W, PI, tt) be a plausibility 
structure satisfying A2* and A3*. Assume that there is a world w E W and a valuation v 
such that 

{PL,w,v) \= Mxiip^fj), 

where x does not appear free in (p. This means that if v' ^x v, then 

{PL,w,v') \= (8) 

Again, for each d E D, let be the valuation such that Vd v and Vd{x) = d. Let 
Ad = {w' : {PL,w' ,Vd) \= ’p} and let B = {w' : {PL,w',v) |= -0}. 

If Pl(Ad) = T for all d, then by A3* we get that Pl(UdAd) = T. In this case {PL, w, v) \= 
(Vx(p)—vacuously. 

On the other hand, if Pl(Ad) 7 ^ T for all d E D, then we must have Pl(Urfg£)Ad) > T. 
By (H), for each d' E D, we have that either Pl(Arf) = T or Pl(Arf fl 5) > Pl(Arf — B). (Note 
that since x is not free in - 0 , we have that {PL,w',v) |= -0 iff {PL,w',Vd) \= V'-) either 
case, we can conclude that 

P\{{AdAd)nB)>P\{Ad>-B). 

Let AL^ be pairwise disjoint sets such that A'^ C Ad, and = Ad. (Such sets must exist; 
see the proof of Proposition [^ .) Thus, we have that Pl((UdAd) fl S) > P1(A(^ — B). Using 
A2*, we get that 

P1((U,A,) n 5) > P\{UdA', -B)= Pl(UdA, - B). 
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We conclude that 


{PL,w,v) h 


as desired. I 


Proposition 5.10: The formula Lottery A Crooked {true — >false) true in structures 


satisfying A2'^ and A3*. 

Proof: Let PL = (H, W, PI, vr) be a plausibility structure satisfying A2l and A3*. Suppose 
also that PL |= Lottery A Crooked. Since 


PL 1= 3y\/x{x ^ y ^ {{Winner{x) V Winner{y)) — >Winner{y))), 
there must be some domain element d^ & D and valuation v such that v{y) = do cind 
{PL,v) 1= Va;(a; ^y ^ {{Winner{x) V Winner{y)) — >Winner{y))). 


This in turn means that if V, then 


{PL,w,v') 1= (lTmner(a;) V Winner{y))^Winner{y). 


( 9 ) 


For each d & D, Vd be such that Vd x and Vd{x) = d. Let Ad = {w' G W : {PL,w,Vd \= 
Winner(x)}, let A = UdAd, and let B = W — A. 

It is immediate from (|^) that either 

(a) Pl(ArfJ = Pl(Ado - Ad) = -L for all d ^ do, or 

(b) PL{Ado) > PL{Ad - Ado) all d^do. 

Assume that (a) is true. By A3*, we get that Pl(UdArf) = T. Since PL |= true^3x Winner{x), 
we get that either Pl(lF) = T or Pl(UdArf) > P1(1F — (UdAd)). Since the latter inequality is 
inconsistent, we conclude that Pl(lF) = T and, thus, PL \= true^false, as desired. 

Now assume that (b) is true. From A2l, it follows that 


Pl{Ado)^Pl{A-Ado) 


( 10 ) 


Since PL \= \/x{true — >^Winner{x)), we have that 

{PL,Vdo) H —^-1 lFmner(a;) 



( 11 ) 


(12) 
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Using A2, ( 0 ). and O, we get that P1(A — > Pl(Arf(j U -B) > Pl(A£i(,). This, however, 

contradicts (pTl), showing that (b) is impossible. | 


Theorem |7.1|: is a sound and complete axiomatization of with respect to Vftat^. 


Proof: We proceed much as in the proof of Theorem |4.1|, using the same notation. Again 
we can assume without loss of generality that is a sentence. Using standard arguments of 
hrst-order logic, we can show that there is a C-good maximal consistent set of sentences C* 
that includes ip. (The second property of C-goodness, that Wx'if G C* implies 'ip[x/c] G C*, 
now follows immediately from the axioms, since we can substitute constants in arbitrary 
contexts, and the proof of the hrst property is just the standard hrst-order proof, again 
because we get to use all the standard hrst-order axioms with no change.) 

We construct a hrst-order statistical plausibility structure PL = {Dom, PI, vr) by again 
taking Dom = {[c] : c G C} and dehning vr so that 7r(c) = [c] for the constants c G C and vr 
for the symbols in <h' is determined by the atomic sentences in C* in the obvious way. The 
dehnition of PI is also similar in spirit to that in the proof of Theorem O. We take the 
domain of PI to be all the dehnable subsets of Dom°°. More precisely, given a formula (p, let 
(fv be the formula that results by replacing all free occurrences of Xi in (p by some constant 
in the equivalence class v^xf). (It doesn’t matter which one.) Let = {n : (p.y G C*}. 
The dehnable subsets are precisely those of the form for some formula ip. Note that the 
dehnable subsets do form an algebra, since U = A^vp A^ = A-,^. We dehne PI 
on this algebra by taking Pl(A^) < Pl(Ap) ih <^9 V "0 ip & C*, where X consists of all the 
variables free in ipW fj. We leave it to the reader to check that for every sentence ip G 
we have PL ^ ^p iS ^p e C*. | 
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